NIKHEF-2013-011, CP3-13-14 



o 

u 

< 

m 
(N 

Ph. 

P.: 

CD 



> 

o 



X 



Unravelling tth via the matrix element method 

Pierre Artoisenef^, Priscila de Aquino'', Fabio Maltoni'^, Olivier Mattelaer'^-'^ 

" Nikhef Theory Group, Science Park 105, 1098 XG Amsterdam, The Netherlands 

' Theoretische Natuurkunde and IIHE/ELEM, Vrije Universiteit Brussel, 

and International Solvay Institutes, Pleinlaan 2, B-1050 Brussels, Belgium 

'^ Centre for Cosmology, Particle Physics and Phenomenology (CP3), 

Universite catholique de Louvain, Chemin du Cyclotron 2, B-1348 Louvain-la-Neuve, Belgium 

'^Department of Physics, University of Illinois at Urbana- Champaign, 1110 West Green Street, Urbana, IL 61801 

(Dated: April 25, 2013) 

Associated production of the Higgs boson with a top-antitop pair is a key channel to gather further 
information on the nature of the newly discovered boson at the LHC. Experimentally, however, 
its observation is very challenging due to the combination of small rates, difficult multi-jet final 
states and overwhelming backgrounds. In the Standard Model the largest number of events is 
expected when h — >■ 66, giving rise to a W^W~bbbb signature, deluged in if+jets. A promising 
strategy to improve the sensitivity is to maximally exploit the theoretical information on the signal 
and background processes by means of the matrix element method. We show how, despite the 
complexity of the final state, the method can be efficiently applied to discriminate the signal against 
combinatorial and ii+jets backgrounds. Remarkably, we find that a moderate integrated luminosity 
in the next LHC run will be enough to make the signature involving both H^'s decaying leptonically 
as sensitive as the single-lepton one. 



I. INTRODUCTION 

Evidence for the recently discovered new heavy boson 
to be the long-sought-for Higgs particle of the Standard 
Model is already quite compelling [T, I2!] . Rates and dis- 
tributions are compatible with the predictions of a scalar 
particle coupling to other SM particles with a strength 
proportional to their mass. The current sensitivities and 
accuracies of the golden production-decay modes, how- 
ever, are not sufhcicnt to draw a final answer on the 
strength and the structure of the couplings without ad- 
ditional hypotheses. Other channels need to be investi- 
gated. 

Prominent among the yet-to-be-explored production 
modes is the tth associated production. The main in- 
terest of this channel stems from the fact that the rate is 
manifestly proportional to the square of the SM Yukawa 
coupling to the top quarks. In addition, more differen- 
tial observables could bring information on the coupling 
structure pS] and on the Higgs parity [4]. This chan- 
nel, however, is notoriously difficult for several reasons. 
The first is that production rates at hadron colliders are 
quite small due to the need of a large cms collision en- 
ergy for the initial partons, strongly suppressed by par- 
ton distribution functions. Next-to-leading order calcu- 
lations 0-0] predict a SM rate of 0.137 pb and 0.632 
pb with 0(10%) uncertainty at the LHC for -y/s ~ 8 
and 14 TeV, respectively. Current searches mainly fo- 
cus on the dominant decay mode h ^>- hh and there- 
fore on a W~^W~bbbb final state, other decays, such 
as /i -J> W+W- ^, h ^ T+T- [I and eventually, 
h ^- jj [i3] , being much rarer demand larger integrated 
luminosities. The second reason is that the W'^W~bbbb 
signature is affected by two different types of challenging 
backgrounds. On the one hand ti + light- or heavy-flavor 
jets because of the enormous rates, and on the other hand 



the intrinsic combinatorial background that stems from 
the difficulty of correctly identifying out of four 6-jets 
the two from the Higgs decay. Finally, the complexity 
of the final state makes its kinematic reconstruction not 
straightforward mainly due to finite jet energy resolution, 
missing energy and the ubiquity of extra QCD radiation. 

Due to the above intrinsic difficulties, the prospects of 
first using this channel for discovery or just for observa- 
tion have been constantly deteriorating as more accurate 
predictions and simulations were available to the LHC 
community. More recently, the attention on this channel 
was revived by Plehn et al. [il| who suggested that while 
drastically lowering the rates, boosted tops and Higgs in 
the final state would make the combinatorial background 
much less severe, improving the significance S/\B of the 
SM Higgs observation at large enough luminosities. 

In this work we argue that the sensitivity to tth can be 
also enhanced at low px, i.e., where the bulk of the cross 
section resides, by means of the matrix element reweight- 
ing method, improving the prospects for observation of 
this channel at the LHC in the coming years. The matrix 
element method is able to efficiently reduce the combina- 
torial problem for the single-lepton final states and even 
more for the di-lepton final state, bringing the two to a 
comparable level of sensitivity already for moderate in- 
tegrated luminosities. 



II. THE MATRIX ELEMENT METHOD 

The matrix element reweigthing method (MEM), origi- 
nally introduced in Ref . [l3] , assigns probabilities to com- 
peting hypotheses, e.g., signal vs. signal-|-background, 
given a sample of experimental events. The most at- 
tracting feature of this method is that it makes maximal 
use of both experimental information and the theoretical 



model, associating a weight to each event based on the 
value of the matrix element {i.e., the scattering ampli- 
tude) for that specific final state configuration for each of 
the hypotheses. While very simple in its essence, in prac- 
tice several technical and conceptual challenges arise and 
different level of simplifications are commonly employed. 
The method, implemented using matrix elements calcu- 
lated at the leading order, has been successfully applied 
to a number of key results in collider physics : from the 
most precise top mass determination [Tj, [ij] , single top 
observation [1^ [ig] at the Tevatron, to the Higgs boson 
discovery and characterization at the LHC [ITI, 13 • Ef- 
forts to include next-to-leading QCD corrections, at least 
for simple final states, have started |19l.l20l|. 

In this work, wc define the weight associated with an 
experimental event x given a set of hypotheses ct as 



P{x\a) 



1 



d^y)\M^\''{y)W{x,y), (1) 



where \Ma\'^{y) is the leading-order matrix element (giv- 
ing the parton-level probability), d^{y) is the phase- 
space measure, (including the parton distribution func- 
tions fi{qi)dqi and f2{q2)dq2) and W{x, y) is the transfer 
function which describes the evolution of (the final state) 
parton-level configuration in y into a reconstructed event 
X in the detector. The normalization by the total cross 
section (Ja in Eq. ([T]) ensures that P{x\(y.) is a probabil- 
ity density, ^ P{x\a.)dx = 1, if the transfer function is 
normalized to one. 

As evident from the definition in Eq. ([T|), the cal- 
culation of each weight involves a non trivial multi- 
dimensional integration of complicated functions over the 
phase space. The problem of computing the weights for 
arbitrary models and processes was tackled in Ref. [21[ 
by implementing a general algorithm in a specifically de- 
signed code named Mad Weight. We stress that very 
fact of automatically, reliably and quickly calculating 
weights for challenging final states as those involved in tth 
has never been achieved before. It is a significant tech- 
nical result on its own that provides key evidence on the 
generality and flexibility of the Mad Weight approach. 

One of the main limitations of the method is that the 
matrix elements are considered at the leading order only 
and therefore extra QCD radiation effects must be han- 
dled in some effective way. In our study we are inclu- 
sive on extra transverse radiation and consider only the 
hardest jets to be matched with the corresponding par- 
tons in the matrix element. The transverse momentum 
of these partons (including isolated leptons) is assumed 
to be balanced against the transverse momentum of extra 
radiation in the event 221. 



III. TECHNICAL ASPECTS 

Parton-level events for sigrial and backgrounds are gen- 
erated with MadGraph 5 [23], passed to Pythia 6 |23| 



process 


incl. a 


efficiency 


^rcc 


tih, single-lepton 
tth, di-lepton 


111 fb 
17.7 fb 


0.0485 
0.0359 


5.37 fb 
0.634 fb 


ti+jets, single-lepton 
tt-l-jets, di-lepton 


256 pb 
40.9 pb 


0.463 X 10"^ 
0.168 X 10"^ 


119 fb 
6.89 fb 



TABLE I: Total cross sections at the LHC 14 TeV and corre- 
sponding efficiency factors of the applied selection. 



for showering and hadronization, employing the MLM- 
kx merging procedure [25| . Detector response simulation 
is performed using Delphes 2 [2^], with the input pa- 
rameters tuned to the values associated with the CMS 
detector. Pile-up effects have not been considered. 

Only the dominant background tt-|-jets is taken into 
account, and is modeled by generating parton-level ti 
processes with up to two extra partons in the 5-flavor 
scheme. For the signal, the parton-level processes tih and 
tih + 1 parton are considered. Inclusive samples for the 
signal and the background have been normalized to the 
total cross section at NLO from [¥\ and [27| , respectively. 
Spin correlation effects in the decays of the tops, which 
for signal shapes are more important than NLO QCD 
corrections [2g|, have been retained. 

The event selection procedure is modeled after that 
adopted by the CMS collaboration for the measurement 
of the ti cross section in the di-lepton channel [29!]. 
Single-lepton (di-lepton) events are required to have at 
least one (one pair of opposite-charge) lepton(s). Only 
isolated electrons or muons are lepton candidates in our 
analysis. They are required to have a transverse momen- 
tum pt > 20 GeV and a pseudo-rapidity ]?7] < 2.4. 

Jets are reconstructed via the anti-fcT algorithm (with 
a cone radius R = 0.5) as implemented in FastJet [30] 
and applied on the calorimeter cells fired by the gener- 
ated stable or quasi-stable particles. Jet candidates are 
required to have pr > 30 GeV and \ri\ < 2.5, and not 
to overlap with any selected leptons. At least four b- 
jets are required. They are identified with an efficiency 
£(, = 0.7 while mis-tag rates Ec = 0.2 for charm quarks 
and ej = 0.015 for fight partons are used. The cross sec- 
tions for signal and backgrounds together with the final 
efficiencies of the adopted minimal selection procedure 
are collected in Table H] 

Only transfer functions for the jet energies are taken 
with a finite resolution, which we parametrize through 
a double-Gaussian shape function characterized by five 
independent parameters. For each of them, an energy 
dependence ci -I- c^yE -I- c-^E is used. The constants Ci 
are determined from an independent ti sample where well 
separated jets (including light and &-jets) are matched 
to the corresponding partons. The typical resolution 
for the jet energy is between 5 and 12 GeV, with tails 
parametrized by Gaussians of width as large as 30 GeV. 
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FIG. 1: Left: Normalized distributions of events with respect 
to the MEM-based observable D for the di-lepton (top) and 
single- lepton (bottom) channels. Right: Efficiency of selecting 
signal vs. background using a D > Dm™ cut. 



IV. RESULTS 

For a generic event i with kinematics Xi 
based observable D,- is defined as follows: 



D, 



Pix,\S) 



Pix,\S) + Pix,\B) 



tfie MEM- 



IS) 



Expected (normalized) distributions of signal and back- 
ground events with respect to this observable are named 
Ds and Db, and are shown in Fig. [1] (left). The plots 
show that for the same number of signal events the MEM- 
based observable delivers a higher discriminating power 
in the case of the di-lepton channel. This is manifest in 
the right-hand plot of the same figure where the e^ ver- 
sus eb efficiencies resulting from a cut on the observable 
D > Drain are shown. This may seem surprising at first 
sight, given that the di-lepton channel is characterized 
by two missing particles in the final state, against only 
one in the single-lepton channel. However, the di-lepton 
channel is much cleaner, with only 6-jets required in the 
final state, a lower probability of erroneously including 
extra QCD radiation and, eventually, a more manage- 
able combinatorial background. 

In order to assess the significance that can be achieved 
at the LHC y/s = 14 TeV for a given luminosity £, 
we consider a large number of pseudo-experiments, each 
with a number of events set to iV = a^J^ L (with a""^^ the 
reconstructed cross section, see Table U last column) . In 
the B-only hypothesis, the number of signal and back- 
ground events are set to s = and b — N. In the S + B 
hypothesis, s and b are generated under the constraint 
s + b — N according to the product of Poisson distribu- 
tions with mean values NsQ/^SQ + bo) and iV6o/(so + ^0)1 
respectively. Here so and 60 are the expected number of 
reconstructed events after rescaling the signal cross sec- 
tion by a parameter ^, i.e. bo = a^^ C and .sq — 11 Cg?^ C. 
For each event, the corresponding Di value is generated 
according to the probability law Ds (in the case of a sig- 
nal event) or Db (in the case of a background event) 
shown in Fig. [TJ This procedure is used to generate 
10^ pseudo-experiments under each hypothesis (B-only 
or S-fB) at a given luminosity C. 

For each pseudo-experiment the likelihood ratio L^ is 
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FIG. 2: Left: Log likelihood profiles in the case of the di- 
lepton channel, assuming a luminosity of 32 fb~^ at 14 TeV 
and setting fi = 1 (SM cross section). Right: Expected upper 
bound on the tih cross section (in units of SM cross section) 
at 95 % C.L. 



calculated as follows: 

N 



-R 



n 

i 

N 

n 



roP{x,\S) + {l-ro)P{x,\B) 
P{x,\B) 

roA + (l-ro)(l-A) 

(1-A) 



(3) 



with ro = so/(so + bo). The resulting B-only and 
S + B distributions of pseudo-experiments with respect 
to In {L^) are shown in Fig. [2] (left) in the case of the 
di-lepton channel, with C —32 fb~^ and n — 1. The 
two distributions are shifted towards positive values of 
In {L^) , which indicates that the MEM weights do not 
exactly describe the phase-space distributions of back- 
ground and signal events. This bias originates from the 
approximations inherent to the calculation of the weights, 
e.g., the assumed parametrization of the transfer function 
and the effective treatment of beyond- leading-order QCD 
radiations. 

By smearing the value of bo according to a log-normal 
distribution (mean=6o, std=0.26o) before generating s 
and b in each pseudo-experiment, we also verified that 
systematic uncertainties on the background normaliza- 
tion have a negligible impact on the distributions of 
pseudo-experiments with respect to In (L^) ■ On the 
other hand, already a 20% uncertainty on 69 hampers 
a counting analysis based on the number of events to be 
available at LHC, unless s/b 3> 0.2. 

We repeat this exercise with different values of /i until 
the median of the i?-only distribution cuts 5% of the left- 
hand tail of the S + B distribution. Such a value of /i 
provides us with the estimate /i x a {tih) of the expected 
upper bound on the signal cross section at 95 % C.L. in 
the absence of signal. Fig. [5] (right) shows our estimate 
of the parameter ^ as a function of the luminosity £, 
separately for the di-lepton and single-lepton channels. 
We observe that the sensitivity achieved in the di-lepton 
channel is slightly better than the one in the single-lepton 
channel at large luminosities. 



V. CONCLUSIONS 

The matrix element reweighting method is a power- 
ful technique to enhance the sensitivity of searches and 
measurements at colliders. In this work we have applied 
it to the observation of Higgs production in association 
with tt pair, in particular to final state signatures involv- 
ing the decay of the Higgs to bottom quarks. First, we 
have verified that the general algorithm implemented in 
Mad Weight provides the possibility of automatically, 
reliably and quickly calculating weights for final states 
as complex as those featured in tih. This technical re- 
sult, by itself, is an important one as it opens the door 
to applications of the MEM to a much wider set of pro- 
cesses and analyses than what has been done so far. Sec- 
ond, we have applied the method to tih with both one- 
and two-lepton final states. We have found that the di- 
lepton final state, though penalized by a smaller number 
of expected events and possibly more difficult to recon- 
struct due to the presence of two neutrinos in the final 
state, becomes competitive with the single-lepton chan- 
nel already after a moderate integrated luminosity. We 
reckon that this result, while based on MC simulations, is 
rather robust and encourage more refined investigations. 
For instance, relaxing the number of requested 6-tags to 
three in the di-lepton final state would bring a signifi- 
cant increase in the statistics, yet not of the combinato- 
rial background, leading to a further relative gain with 
respect to the single-lepton final state. Analogously, the 
inclusion of pile-up will impact more the signature with 



the largest number of jets in the final state. Only fully- 
fiedged experimental analyses can assess the final gains, 
and correctly include the systematic uncertainties that 
have been neglected here. 

In conclusion the search for SM tih and in particu- 
lar the di-lepton final state is a perfect illustration of 
the power of the matrix element method in providing 
additional leverage in difficult analyses. Further inves- 
tigations concerning the possibility of using the matrix 
element method in tth to access more detailed informa- 
tion on the structure of the couplings of the new boson 
or in other very challenging production channels, such at 
thj [3l|, are foreseen. 
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